Traffic Light Control by Multiagent Reinforcement Learning Systems
نویسندگان
چکیده
Traffic light control is one of the main means of controlling road traffic. Improving traffic control is important because it can lead to higher traffic throughput and reduced congestion. This chapter describes multiagent reinforcement learning techniques for automatic optimization of traffic light controllers. Such techniques are attractive because they can automatically discover efficient control strategies for complex tasks, such as traffic control, for which it is hard or impossible to compute optimal solutions directly and hard to develop hand-coded solutions. First the general multi-agent reinforcement learning framework is described that is used to control traffic lights in this work. In this framework, multiple local controllers (agents) are each responsible for the optimization of traffic lights around a single traffic junction, making use of locally perceived traffic state information (sensed cars on the road), a learned probabilistic model of car behavior, and a learned value function which indicates how traffic light decisions affect long-term utility, in terms of the average waiting time of cars. Next, three extensions are described which improve upon the basic framework in various ways: agents (traffic junction controllers) taking into account congestion information from neighboring agents; handling partial observability of traffic states; and coordinating the behavior of multiple agents by coordination graphs and the max-plus algorithm. Bram Bakker Informatics Institute, University of Amsterdam, Science Park 107, 1098 XG Amsterdam, The Netherlands, e-mail: [email protected] Shimon Whiteson Informatics Institute, University of Amsterdam, Science Park 107, 1098 XG Amsterdam, The Netherlands e-mail: [email protected] Leon J.H.M. Kester Integrated Systems, TNO Defense, Safety and Security, Oude Waalsdorperweg 63, 2597 AK Den Haag, The Netherlands e-mail: [email protected] Frans C.A. Groen Informatics Institute, University of Amsterdam, Science Park 107, 1098 XG Amsterdam, The Netherlands e-mail: [email protected]
منابع مشابه
JADE, TraSMAPI and SUMO: A tool-chain for simulating traffic light control
Increased stress, fuel consumption, air pollution, accidents and delays are some of the consequences of traffic congestion usually incurring in tremendous economic impacts, which society aims to remedy in order to leverage a sustainable development. Recently, unconventional means for modeling and controlling such complex traffic systems relying on multiagent systems have arisen. This paper cont...
متن کاملCooperative, hybrid agent architecture for real-time traffic signal control
This paper presents a new hybrid, synergistic approach in applying computational intelligence concepts to implement a cooperative, hierarchical, multiagent system for real-time traffic signal control of a complex traffic network. The large-scale traffic signal control problem is divided into various subproblems, and each subproblem is handled by an intelligent agent with fuzzy neural decision-m...
متن کاملMultiagent Reinforcement Learning and Coordination for Urban Traffic Control using Coordination Graphs and Max-plus Master’s Thesis Artificial Intelligence Specialization Intelligent Systems
متن کامل
Multiagent Reinforcement Learning for Urban Traffic Control Using Coordination Graphs
Since traffic jams are ubiquitous in the modern world, optimizing the behavior of traffic lights for efficient traffic flow is a critically important goal. Though most current traffic lights use simple heuristic protocols, more efficient controllers can be discovered automatically via multiagent reinforcement learning, where each agent controls a single traffic light. However, in previous work ...
متن کاملA Multiagent Reinforcement Learning algorithm to solve the Community Detection Problem
Community detection is a challenging optimization problem that consists of searching for communities that belong to a network under the assumption that the nodes of the same community share properties that enable the detection of new characteristics or functional relationships in the network. Although there are many algorithms developed for community detection, most of them are unsuitable when ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010